A simplified version of the ITU algorithm for objective measurement of speech codec quality
نویسنده
چکیده
ITU-T Recommendation P.861 describes an objective speech quality assessment algorithm for speech codecs [1]. This algorithm transforms codec input and output speech signals into a perceptual domain, compares them, and generates a noise disturbance value, which can be used to estimate perceived speech quality. The performance of this algorithm can be judged by the correlation between those estimates and actual listener opinions from formal subjective listening tests. We show that significant simplifications can be made to the P.861 algorithm with very minimal effect on its performance. Specifically, for the portions of the algorithm under study here, 64% of the floating point operations can be eliminated with only a 3.5% decrease in average correlation to listener opinions. The resulting simplified algorithm may offer a practical new objective function to drive parameter selections, excitation searches, and bit-allocations in speech and audio coders.
منابع مشابه
Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs
Previous objective speech quality assessment models, such as bark spectral distortion (BSD), the perceptual speech quality measure (PSQM), and measuring normalizing blocks (MNB), have been found to be suitable for assessing only a limited range of distortions. A new model has therefore been developed for use across a wider range of network conditions, including analogue connections, codecs, pac...
متن کاملA voice activity detector for the ITU-t 8kbit/s speech coding standard g.729
Voice Activity Detectors (VAD's) are widely used in speech technology applications where available transmission or storage capacity is limited (e.g. mobile, DCME, etc.) and must be utilised with maximum economy. Modern day digital speech coding algorithms can provide toll quality speech at bit-rates as low as 8kbit/s (e.g. ITU-T G.729) and the use of a VAD can achieve further economy in average...
متن کاملPerformance evaluation of objective quality measures for coded speech
Low bit-rate speech coding is a key technology for multimedia telecommunications. A number of coding algorithms have been developed for various applications. When optimizing or characterizing a codec, for example, one needs to evaluate its performance based on a subjective quality assessment, which is time-consuming and expensive. Therefore, objective quality measures that correlate well with s...
متن کاملConsidering Bluetooth’s Subband Codec (SBC) for Wideband Speech and Audio on the Internet
The Bluetooth Special Interest Group (SIG) has standardized the subband coding (SBC) audio codec to connect headphones via wireless Bluetooth links. SBC compresses audio at high fidelity while having an ultra-low algorithm delay. To make SBC suitable for the Internet, we extend it by using a time and packet loss concealment (PLC) algorithm that is based on ITU’s G.711 Appendix I. The design is ...
متن کاملThe adaptive multirate wideband speech codec (AMR-WB)
This paper describes the Adaptive Multirate Wideband (AMR-WB) speech codec recently selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services. The AMR-WB speech codec algorithm was selected in December 2000 and the corresponding specifications were approved in March 2001. The AMR-WB codec...
متن کامل